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PRELIMINARY AMENDMENT 

Dear Sir: 

In response to the Notice to File Missing Parts dated November 30, 2001, Applicants 
submit this Preliminary Amendment to insert SEQ ID NOs into the Application. Entry of this 
Amendment before examination of the Application on the merits is respectfully requested. 
Please amend the Application as follows: 

In the specification: 

Please amend the specification as set forth below. A marked of version of the 
replacement paragraphs follows the remarks section. 
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On Page 14, please replace the 10 th complete paragraph with the following paragraph: 

In other embodiments, the weak DNA-binding domain comprises two Cys 2 His 2 (SEQ ID 
NO: 4) zinc fingers. 

On pages 18-19, please replace the paragraph bridging pages 18-19 with the following 
paragraph: 

Fig. 3. Recognition helix sequences of fingers isolated by our selection. For candidates 
that were isolated multiple times (as judged by nucleotide sequence), the number of clones 
obtained is shown in parentheses. The consensus sequence(s) of fingers selected by phage 
display for each target subsite are also shown (ref. 6, + denotes a positively charged residue, _ 
denotes no discernible preference). Asterisks indicate candidates with a 2 bp deletion 
downstream of the sequence encoding the recognition helix. Arrows illustrate a few of the most 
plausible potential base contacts. (SEQ ID NOS 23-91, respectively, in order of appearance). 

On page 19, please replace the sixth (6 th ) complete paragraph with the following 
paragraph: 

Fig. 9. This figure shows the results of a certain embodiment of the subject interaction 
trap assay wherein a DNA-sequence can be selected which interacts with a specific protein. 
(SEQ ID NOS 15-22, respectively, in order of appearance). 

On page 19, please replace the seventh (7 th ) complete paragraph with the following 
paragraph: 

Fig. 10. This figure shows a certain embodiment of the subject ITS wherein a reporter 
gene under the control of a basal promoter is used to identify false positives arising from 
amplification of the reporter gene construct. (SEQ ID NO: 92). 
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On page 19, please replace the eighth (8 th ) complete paragraph with the following 
paragraph: 

Fig. 11. This figure shows a certain embodiment of the subject ITS wherein protein 
dimerization may be detected. (SEQ ID NO: 93). 

On page 19, please replace the ninth (9 th ) complete paragraph with the following 
paragraph: 

Fig. 12. This figure shows another embodiment of the subject ITS wherein protein 
dimerization may be detected. (SEQ ID NO: 94). 

On page 20, please replace the first (1 st ) complete paragraph with the following paragraph: 

Fig. 14. This figure shows the result of a certain embodiment of the ITS for isolation of 
a novel DNA binding domain from a library of random polypeptides wherein the polypeptide 
does not bind to the promoter region of either reporter gene. (SEQ ID NOS: 95-97, respectively, 
in order of appearance). 

On page 20, please replace the second (2 nd ) complete paragraph with the following 
paragraph: 

Fig. 15. This figure shows the result of a certain embodiment of the ITS for isolation of 
a novel DNA binding domain from a library of random polypeptides wherein the polypeptide 
non-specifically binds to the promoter region of both reporter genes. (SEQ ID NOS: 95-97, 
respectively, in order of appearance). 

On page 20, please replace the third (3 rd ) complete paragraph with the following 
paragraph: 

Fig. 16. This figure shows the result of a certain embodiment of the ITS for isolation of a 
novel DNA binding domain from a library of random polypeptides wherein the polypeptide 
specifically binds to the promoter region of one of the reporter genes. (SEQ ID NOS: 95-97, 
respectively, in order of appearance). 



20/489678.1 



-3- 



U.S.S.N. 09/990,762 
Joung et al. 

On page 20, please replace the fourth (4 th ) complete paragraph with the following 
paragraph: 

Fig. 17. This figure shows the alternative result the ITS embodiment shown in Fig, 16, 
wherein the polypeptide specifically binds to the promoter region of the other reporter gene. 
(SEQ ID NOS: 95-97, respectively, in order of appearance). 

On pages 32-33, please replace the paragraph bridging pages 32-33 with the following 
paragraph: 

The term "zinc finger protein" or "ZFPs" or "zinc finger polypeptide" refers to proteins 
that bind to DNA, RNA and/or protein, in a sequence-specific manner, by virtue of a metal 
stabilized domain known as a zinc finger. See, for example, Miller et al (1985) EMBO J. 
4:1609-1614; Rhodes et al (1993) Scl Amer. Feb:56-65; and Klug (1999) 1 Mol Biol 
293:215-218. The most widely represented class of ZFPs, known as the C 2 H 2 (SEQ ID NO: 4) 
ZFPs, comprises proteins that are composed of zinc fingers that contain two conserved cysteine 
residues and two conserved histidine residues. Over 10,000 C 2 H 2 (SEQ ID NO: 4) zinc fingers 
have been identified in several thousand known or putative transcription factors. Each C 2 H 2 
(SEQ ID NO: 4) zinc finger domain comprises a conserved sequence of approximately 30 amino 
acids that contains the invariant cysteines and histidines in the following arrangement: -Cys-(X) 2 „ 
4 -Cys-(X)i 2 -ffis-(X)3_5-His (SEQ ID NO: 1). In animal genomes, polynucleotide sequences 
encoding this conserved amino acid sequence motif are usually found as a series of tandem 
duplications, leading to the formation of multi-finger domains within a particular transcription 
factor. As used herein, "zinc finger protein" refers to known zinc finger proteins, or fragments 
thereof, or to novel polypeptides isolated by the methods of the invention. 

On page 63, please replace the 3 rd complete paragraph with the following paragraph: 

A second class comprises proteins in which the DNA-binding domain is comprised of 
multiple reiterated modules that cooperate to achieve high-affinity binding of DNA. An example 
is the Cys 2 His 2 (SEQ ID NO: 4) class of zinc-finger proteins, which typically contain a tandem 
array of from two or three to dozens of zinc-finger modules. Each module contains an alpha- 
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helix capable of contacting a three to five base-pair stretch of DNA. Typically, at least three 
zinc-fingers are required for high-affinity DNA binding. Therefore, one or two zinc-fingers 
constitute a low-affinity DNA-binding domain with suitable properties for use as a component in 
this invention. Examples of proteins of the C2H2 class include TFIIIA, Zif268, Gli, and SRE- 
ZBP. (These and other proteins and DNA sequences referred to herein are well known in the art. 
Their sources and sequences are known.) 

On page 78, please replace the second (2 nd ) complete paragraph with the following 
paragraph: 

In this report we describe the design and testing of an E.coli-based selection method that 
can detect either protein-DNA or protein-protein interactions and that can handle libraries larger 
than 10 8 in size. We tested our new method by selecting Cys 2 His 2 (SEQ ID NO: 4) zinc finger 
variants similar to those previously isolated by phage display (6, 13). The results of our selection, 
the rapidity of our method, and the versatility of the underlying transcriptional activation scheme 
suggest that this bacterial-based system should provide a useful tool for identifying and 
characterizing protein-DNA and protein-protein interactions. 

On pages 78-79, please replace the paragraph bridging pages 78-79 with the following 
paragraph: 

The Gall lP-Zifl23 fusion protein contains residues 263-352 of the yeast Gall IP protein 
(with a N342V mutation [14]) fused by a nine amino acid linker Ala-Ala- Ala-Pro- Arg-Val-Arg- 
Thr-Gly (SEQ ID NO: 5) to residues 327-421 of Zif268 (the region encoding the three zinc 
fingers). The phagemid pBR-GP-Z123 expresses the Gall lP-Zif!23 hybrid protein from an 
IPTG-inducible lacUVS promoter. The pBR-GP-Z12BbsI phagemid is analogous to pBR-GP- 
Z123 except that Zif finger 3 is replaced with a modified Zif finger 1 in which the sequence 
encoding residues -1 through 6 of the finger recognition helix is replaced by unrelated sequence 
(a "staffer" fragment) flanked by Bbsl restriction sites. All phagemids used in this study can be 
easily "rescued" from cells by infection with a filamentous helper phage; infectious phage 
particles produced by these cells contain single-stranded phagemid DNA. 
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On page 79, please replace the first (1 st ) complete paragraph with the following paragraph: 

The reporter construct that expresses HIS3 (PzirHISS-aadA) has the Zif268 binding site 
sequence 5 GCGTGGGCG 3 centered at base pair -63 relative to the transcription start site of a 
weak E. coli lac promoter derivative (the P W k promoter). The three selection strain reporters 
change the zinc finger binding site of Pzif HIS3-aadA, replacing the sequence 
5 TCGACAAGCGTGGGCG 3 (SEQ ID NO: 6) (bases -74 to -59 relative to the transcription 
start site) with sequences that should allow binding of the desired zinc finger variants: 
5 CAAGGGTTCAGGGGCG 3 (SEQ ID NO: 7) (for NRE), 5 GGCTATAAAAGGGGCG 3 (SEQ 
ID NO: 8) (for TATA), or 5 TGGGACATGTTGGGCG 3 (SEQ ID NO: 9) (for p53). Each of 
these reporters was transferred (by recombination) to an F episome encoding lacf 1 repressor and 
then introduced into strain KJ1C in a single step essentially as previously described (15, J.KJ. & 
C.O.P., unpublished). The resulting strains were then each transformed with the pACL-aGal4 
plasmid to create the NRE, TATA, p53, and Zif "selection strains." 

On page 82, please replace the first (1 st ) complete paragraph with the following paragraph: 

Zinc finger domains can bind DNA and activate transcription in E.coli. We tested 
our new E.coli-based system by applying it to a problem previously studied using phage display: 
the selection, from a large randomized library, of zinc finger variants with altered DNA binding 
specificities (for review, see 21). However, before proceeding with selections, we first examined 
whether a wild-type zinc finger protein could bind DNA and activate transcription in our system. 
(Relatively little information was available on the activity of Cys2His2 (SEQ ID NO: 4) zinc 
finger proteins in bacteria.) To do this, we constructed fusion proteins containing fragments of 
the yeast Gall IP and Gal4 proteins that had previously been shown to interact with each other 
(10, 14). Thus, we fused a Gall IP fragment to the three zinc fingers of the murine Zif268 protein 
(creating the Gall lP-Zifl23 protein), and we replaced the carboxy-terminal domain of the E.coli 
RNA polymerase a subunit with a Gal4 fragment (creating the chimeric ccGal4 protein). A 
Zif268 DNA binding site was positioned upstream of our P W k-HIS3-aadA operon to create the 
Pzif HIS3-aadA operon (Figure 1C), and this cassette was introduced into a AhisB E.coli strain in 
single copy to create the "Zif reporter strain." 
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On page 86, please replace the first (1 st ) complete paragraph with the paragraph set forth 
below. Please note that the underlining appearing in lines 9 and 10 of this paragraph (Le., 
NSGSHK and NHGSWK) appears in the original text 

For the NRE target subsite, an initial attempt using our new selection method yielded 
only one finger (NSGSWK) (SEQ ID NO: 10) that bound preferentially to the target sequence. 
Based on our existing knowledge of zinc finger-DNA recognition (reviewed in 21), one can 
postulate reasonable contacts between recognition helix residues of this finger and bases in the 
primary strand of the NRE subsite (Figure 3C). However, we were initially concerned by the 
relatively low frequency of fingers selected for this site, and we repeated the selection using an 
additional enrichment step in an attempt to isolate more fingers. The great majority of sequences 
isolated this way had the same amino acid sequence as the candidate originally selected 
(NSGSWK) (SEQ ID NO: 10) but two closely related sequences (NSGSHK (SEQ ID NO: 11) 
and NHGSWK (SEQ ID NO: 12)) were also identified. These results suggested that we might 
have obtained a small number of clones merely because very few candidates in our library can 
pass the threshold set in our NRE selection. 

On page 86, please replace the second (2 nd ) complete paragraph with the paragraph set 
forth below. Please note that the underlining appearing in line 1 of this paragraph (i.e., 
not) appears in the original text. 

As shown in Figure 3C, the sequences of fingers isolated in our NRE selections do not 
match the consensus sequence for fingers selected by phage display. We performed several 
experiments to explore the basis of this difference: We first checked our library by sequencing 
random candidates to ensure that there was no drastic bias in nucleotide distribution and were 
able to rule this out as a plausible explanation (unpublished data). We then decided to directly 
introduce (in exactly the same context) one of the fingers (TRTNKS) (SEQ ID NO: 13) that had 
been selected by phage display (6) and see whether it could work in our system as a Gall IP-zinc 
finger fusion protein. We find that NRE selection strain cells expressing this TRTNKS (SEQ ID 
NO: 13) finger fusion protein grow very poorly on HIS selective medium whereas the same cells 
expressing the NSGSWK (SEQ ID NO: 10) finger fusion (obtained in our selections) grow 
robustly (unpublished data). The simplest explanation for this result is that the TRTNKS (SEQ 
ID NO: 13) finger fusion binds poorly to the NRE subsite and therefore only weakly stimulates 
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HIS3 expression. This explanation is supported by our observation that earlier selections with the 
NRE subsite, using a prototype of our system in which zinc fingers were expressed from a much 
higher copy number phagemid, had yielded the TRTNKS (SEQ ID NO: 1 3) as well as the 
NSGSWK (SEQ ID NO: 10) finger (J.KJ. and C.O.P., unpublished data). This suggests that our 
current system sets a very stringent standard for the NRE selections and may account for why we 
isolated such a small number of specific candidates. 

On page 87, please replace the first (1 st ) complete paragraph with the following paragraph: 

We also used our binding site preference assay to compare the specificity of the 
NSGSWK (SEQ ID NO: 10) finger we had selected for the NRE subsite with that of the 
TRTNKS (SEQ ID NO: 13) finger selected by phage display. In our bacterial-based assays, the 
NSGSWK (SEQ ID NO: 10) finger binds specifically to the NRE subsite and binds only very 
weakly to the TATA subsite. By contrast, the TRTNKS (SEQ ID NO: 13) finger binds only 
weakly to all four subsites (exhibiting a preference for the NRE and TATA subsites over the p53 
and Zif subsites) (unpublished data). These results suggest that the NSGSWK (SEQ ID NO: 10) 
finger we selected actually binds more tightly and specifically in our system than the TRTNKS 
(SEQ ID NO: 13) finger identified earlier by phage display. 

On page 87, please replace the second (2 nd ) complete paragraph with the following 
paragraph: 

Each of our three selections also yielded a small percentage of fingers that bind non- 
specifically to all four DNA subsites tested. Surprisingly, all of these fingers match a consensus 
sequence of the form R+WL+L (SEQ ID NO: 14) (where + denotes a positively charged residue, 
Figure 3D). These fingers are rich in positive charge and may make extra phosphate contacts. 
We also note that all of these fingers have a tryptophan residue at position 2 and thus would not 
have been present in the libraries used for earlier phage display experiments. This highly 
conserved set of non-specific fingers raises many interesting questions: What level of specificity 
is required for a zinc finger protein to function in our assay (and thus to what extent does the E. 
coli chromosome function as a non-specific competitor)? How do these fingers bind? Why is this 
particular class of non-specific fingers the only type selected in our system? 
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On page 94, please replace the first (1 st ) complete paragraph with the paragraph set forth 
below. Please note that the underlining appearing in lines 5 and 7 of this paragraph (i.e., 
CXGGACACGTX and CGGGANNNNNG) appears in the original text 

In addition to selecting proteins that bind to a specific DBS, this bacterial ITS can also be 
used to select DBS's that interact with a specific protein. Figure 9 shows the results for such an 
in vivo site selection to select DNA sequences that interact with the P53 zf protein. The consensus 
binding site, as determined by Wolfe et. al., JMB 285, pl917-1934 (1999), for the 753^ protein 
is CXGG ACACGTX (SEQ ID NO: 15) where X indicates no clear sequence preference at that 
position. A library of EGFP reporter plasmids containing the partially randomized binding site 
CGGGA NNNNN G (SEQ ID NO: 16) was created (where N indicates a mixture of A, G, C, T) 
and introduced into host cells containing the <x-Gal4 and Gall lp-PSS^ fusion proteins. These 



cells were then grown to saturation at 37°C in LB media with the appropriate antibiotics and then 
100 pi of this culture was used to inoculate 10 ml of minimal media (as described in example 3) 
Sj containing 10 ng/ml aTc and 100 pM IPTG. These cultures were then incubated for 24 hours at 
p|j 30° C on a rotating drum incubator. After incubation, one round of FACS sorting was performed 

L on a Cytomation MoFlo multiple laser FACS sorter and individual EGFP positive clones were 

w 

ftj selected. Of 20 clones analyzed, 16 were EGFP positive (i.e. expressed at least 2 fold more 

r*f 

jj EGFP than control cells). These 16 positive clones contained three unique binding sites. 

Q The most abundant of these sites matched the consensus from the in vitro site selection. 

v. U 



In the claims: 

Pursuant to 37 C.F.R. §1.12 1(c)(3), replace claim 18 with claim 18 as set forth below. A 
marked-up version of claim 18 showing the changes made follows the Remarks section of this 
Response. 



18. The method of claim 16, wherein the weak DNA-binding domain comprises two 
Cys 2 His 2 (SEQ ID NO: 4) zinc fingers. 
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Remarks 



Applicants respectfully request entry of this amendment prior to examination of this case 
on the merits. The specification and claims have been amended to insert the appropriate SEQ ID 
NOs corresponding to the Sequence Listing being submitted herewith. No new matter has been 
added by virtue of this amendment. 



If there are any fees due in connection with the filing of this response, please charge the 
fees to our Deposit Account No. 06-1448. If a telephone conversation with Applicant's Agent 
would expedite prosecution of the above-identified application, the Examiner is urged to call the 
undersigned at (617) 832-1770. 

Respectfully submitted, 

FOLEY HOAG, LLP 



Conclusion 



By: 




One Post Office Square 
Boston, MA 02109 



Agent for Applicants 
Registration No. 46,778 



Telephone: (617) 832-1000 
Facsimile: (617) 832-7000 



Date: January 30. 2002 
Customer Number 25181 
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Marked-up version of replacement paragraphs of the specification showing changes made: 

On Page 14, the 10 th complete paragraph was amended as follows: 

In other embodiments, the weak DNA-binding domain comprises two Cys2His2 (SEP ID 
NO: 4) zinc fingers. 

On pages 18-19, the paragraph bridging pages 18-19 was amended as follows: 

Fig. 3. Recognition helix sequences of fingers isolated by our selection. For candidates 
that were isolated multiple times (as judged by nucleotide sequence), the number of clones 
obtained is shown in parentheses. The consensus sequence(s) of fingers selected by phage 
display for each target subsite are also shown (ref 6, + denotes a positively charged residue, _ 
denotes no discernible preference). Asterisks indicate candidates with a 2 bp deletion 
downstream of the sequence encoding the recognition helix. Arrows illustrate a few of the most 
plausible potential base contacts. (SEP ID NOS 23-91, respectively, in order of appearance). 

On page 19, the sixth (6 th ) complete paragraph was amended as follows: 

Fig. 9. This figure shows the results of a certain embodiment of the subject interaction 
trap assay wherein a DNA-sequence can be selected which interacts with a specific protein. 
(SEQ ID NOS 15-22, respectively, in order of appearance). 

On page 19, the seventh (7 th ) complete paragraph was amended as follows: 

Fig. 10. This figure shows a certain embodiment of the subject ITS wherein a reporter 
gene under the control of a basal promoter is used to identify false positives arising from 
amplification of the reporter gene [contract] construct . (SEP ID NO: 92). 

On page 19, the eighth (8 th ) complete paragraph was amended as follows: 

Fig. 11. This figure shows a certain embodiment of the subject ITS wherein protein 
dimerization may be detected. (SEQ ID NO: 93). 
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On page 19, the ninth (9 th ) complete paragraph was amended as follows: 

Fig. 12. This figure shows another embodiment of the subject ITS wherein protein 
dimerization may be detected. (SEP ID NO: 94). 

On page 20, the first (1 st ) complete paragraph was amended as follows: 

Fig. 14. This figure shows the result of a certain [emobidment] embodiment of the ITS 
for isolation of a novel DNA binding domain from a library of random polypeptides wherein the 
polypeptide does not bind to the promoter region of either reporter gene. (SEP ID NOS: 95-97, 
respectively, in order of appearance). 

£jjf On page 20, the second (2 nd ) complete paragraph was amended as follows: 

i 

,|| Fig. 15. This figure shows the result of a certain [emobidment] embodiment of the ITS 

# for isolation of a novel DNA binding domain from a library of random polypeptides wherein the 
polypeptide non-specifically binds to the promoter region of both reporter genes. (SEQ ID NOS: 
£fl 95-97, respectively, in order of appearance). 

ru 
0 

f|j On page 20, the third (3 rd ) complete paragraph was amended as follows: 

cl 

J Fig. 16. This figure shows the result of a certain [emobidment] embodiment of the ITS 

j3 for isolation of a novel DNA binding domain from a library of random polypeptides wherein the 

polypeptide specifically binds to the promoter region of one of the reporter genes. (SEP ED 

NOS: 95-97, respectively, in order of appearance). 

On page 20, the fourth (4 th ) complete paragraph was amended as follows: 

Fig. 17. This figure shows the [alterantive] alternative result the ITS embodiment shown 
in Fig. 16, wherein the polypeptide specifically binds to the promoter region of the other reporter 
gene. (SEP ID NOS: 95-97, respectively, in order of appearance). 
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On pages 32-33, the paragraph bridging pages 32-33 was amended as follows: 

The term "zinc finger protein" or "ZFPs" or "zinc finger polypeptide" refers to proteins 
that bind to DNA, RNA and/or protein, in a sequence-specific manner, by virtue of a metal 
stabilized domain known as a zinc finger. See, for example, Miller et al (1985) EMBO J. 
4:1609-1614; Rhodes et al (1993) Set Amer. Feb:56-65; and Klug (1999) J. Mol Biol 
293:215-218. The most widely represented class of ZFPs, known as the C 2 H 2 (SEP ID NO: 4) 
ZFPs, comprises proteins that are composed of zinc fingers that contain two conserved cysteine 
residues and two conserved histidine residues. Over 10,000 C2H2 (SEP ID NO: 4) zinc fingers 
have been identified in several thousand known or putative transcription factors. Each C2H2 
(SEP ID NO: 4) zinc finger domain comprises a conserved sequence of approximately 30 amino 
acids that contains the invariant cysteines and histidines in the following arrangement: -Cys-(X)2- 
4-Cys-(X)i 2 -His-(X)3_5-His (SEQ ID NO: 1). In animal genomes, polynucleotide sequences 
encoding this conserved amino acid sequence motif are usually found as a series of tandem 
duplications, leading to the formation of multi-finger domains within a particular transcription 
factor. As used herein, "zinc finger protein" refers to known zinc finger proteins, or fragments 
thereof, or to novel polypeptides isolated by the methods of the invention. 

On page 63, the 3 rd complete paragraph was amended as follows: 

A second class comprises proteins in which the DNA-binding domain is comprised of 
multiple reiterated modules that cooperate to achieve high-affinity binding of DNA. An example 
is the Cys2His2 (SEQ ID NO: 4) class of zinc-finger proteins, which typically contain a tandem 
array of from two or three to dozens of zinc-finger modules. Each module contains an alpha- 
helix capable of contacting a three to five base-pair stretch of DNA. Typically, at least three 
zinc-fingers are required for high-affinity DNA binding. Therefore, one or two zinc-fingers 
constitute a low-affinity DNA-binding domain with suitable properties for use as a component in 
this invention. Examples of proteins of the C2H2 class include TFIILA, Zif268, Gli, and SRE- 
ZBP. (These and other proteins and DNA sequences referred to herein are well known in the art. 
Their sources and sequences are known.) 
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On page 78, the second (2 nd ) complete paragraph was amended as follows: 

In this report we describe the design and testing of an £.co/f-based selection method that 
can detect either protein-DNA or protein-protein interactions and that can handle libraries larger 
than 10 8 in size. We tested our new method by selecting Cys2His2 (SEP ID NO: 4) zinc finger 
variants similar to those previously isolated by phage display (6, 13). The results of our selection, 
the rapidity of our method, and the versatility of the underlying transcriptional activation scheme 
suggest that this bacterial-based system should provide a useful tool for identifying and 
characterizing protein-DNA and protein-protein interactions. 

On pages 78-79, the paragraph bridging pages 78-79 was amended as follows: 

The Gall lP-Zifl23 fusion protein contains residues 263-352 of the yeast Gall IP protein 
(with a N342V mutation [14]) fused by a nine amino acid linker Ala-Ala-Ala-Pro- Arg-Val-Arg- 
Thr-Gly (SEO ID NO: 5) to residues 327-421 of Zi£268 (the region encoding the three zinc 
fingers). The phagemid pBR-GP-Z123 expresses the Gall lP-Zifl23 hybrid protein from an 
IPTG-inducible /acUV5 promoter. The pBR-GP-Z12BbsI phagemid is analogous to pBR-GP- 
Z123 except that Zif finger 3 is replaced with a modified Zif finger 1 in which the sequence 
encoding residues -1 through 6 of the finger recognition helix is replaced by unrelated sequence 
(a "staffer" fragment) flanked by Bbsl restriction sites. All phagemids used in this study can be 
easily "rescued" from cells by infection with a filamentous helper phage; infectious phage 
particles produced by these cells contain single-stranded phagemid DNA. 

On page 79, the first (1 st ) complete paragraph was amended as follows: 

The reporter construct that expresses HIS3 (P zif -HIS3-aadA) has the Zif268 binding site 
sequence 5 GCGTGGGCG 3 centered at base pair -63 relative to the transcription start site of a 
weak E. coli lac promoter derivative (the P w k promoter). The three selection strain reporters 
change the zinc finger binding site of P^f- HIS3-aadA, replacing the sequence 
5 TCGACAAGCGTGGGCG 3 (SEO ID NO: 6) (bases -74 to -59 relative to the transcription 
start site) with sequences that should allow binding of the desired zinc finger variants: 
5 CAAGGGTTCAGGGGCG 3 (SEO ID NO: 7) (for NRE), 5 GGCTATAAAAGGGGCG 3 (SEO 
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ID NO: S) (for TATA), or 5 TGGGACATGTTGGGCG 3 (SEP ID NO: 9) (for p53). Each of 
these reporters was transferred (by recombination) to an F' episome encoding lacF repressor and 
then introduced into strain KJ1C in a single step essentially as previously described (15, IK J. & 
C.O.R, unpublished). The resulting strains were then each transformed with the pACL-aGal4 
plasmid to create the NRE, TATA, p53, and Zif "selection strains." 



On page 82, the first (1 st ) complete paragraph was amended as follows: 

Zinc finger domains can bind DNA and activate transcription in ExolL We tested 
our new iJ.co/z-based system by applying it to a problem previously studied using phage display: 
the selection, from a large randomized library, of zinc finger variants with altered DNA binding 
gj specificities (for review, see 21). However, before proceeding with selections, we first examined 
tfj whether a wild-type zinc finger protein could bind DNA and activate transcription in our system. 
M (Relatively little information was available on the activity of Cys2His2 (SEQ ID NO: 4) zinc 
finger proteins in bacteria.) To do this, we constructed fusion proteins containing fragments of 

Hi 

^ the yeast Gall IP and Gal4 proteins that had previously been shown to interact with each other 
flj (10, 14). Thus, we fused a Gall IP fragment to the three zinc fingers of the murine Zi£268 protein 
1^ (creating the Gall lP-Zifl23 protein), and we replaced the carboxy-terminal domain of the E.coli 
f| RNA polymerase a subunit with a Gal4 fragment (creating the chimeric <xGal4 protein). A 
?| Zif268 DNA binding site was positioned upstream of our P w it-HIS3-aadA operon to create the 

Pzif-HIS3-aadA operon (Figure 1C), and this cassette was introduced into a AhisB E.coli strain in 
flj single copy to create the "Zif reporter strain." 



On page 86, the first (1 st ) complete paragraph was amended as set forth below. Please note 
that the SEQ ID NOs have been added by virtue of this amendment All other underlining, 
i.e., NSGSHK and NHGSWK appearing in lines 9 and 10 of this paragraph, was part of the 
original text 

For the NRE target subsite, an initial attempt using our new selection method yielded 
only one finger (NSGSWK) fSEOIDNO: 10) that bound preferentially to the target sequence. 
Based on our existing knowledge of zinc finger-DNA recognition (reviewed in 21), one can 
postulate reasonable contacts between recognition helix residues of this finger and bases in the 
primary strand of the NRE subsite (Figure 3C). However, we were initially concerned by the 
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relatively low frequency of fingers selected for this site, and we repeated the selection using an 
additional enrichment step in an attempt to isolate more fingers. The great majority of sequences 
isolated this way had the same amino acid sequence as the candidate originally selected 
(NSGSWK) (SEOIDNO: 10) but two closely related sequences (NSGSHK (SEOIDNO: 11) 
and NHGSWK (SEOIDNO: 12)) were also identified. These results suggested that we might 
have obtained a small number of clones merely because very few candidates in our library can 
pass the threshold set in our NRE selection. 



On page 86, the second (2 n ) complete paragraph was amended as set forth below. Please 
note that the SEQ ID NOs have been added by virtue of this amendment. All other 
q underlining, i.e., not appearing in line 1 of this paragraph, was part of the original text. 

i 

l| As shown in Figure 3C, the sequences of fingers isolated in our NRE selections do not 

match the consensus sequence for fingers selected by phage display. We performed several 
H f experiments to explore the basis of this difference: We first checked our library by sequencing 
]*j random candidates to ensure that there was no drastic bias in nucleotide distribution and were 
s able to rule this out as a plausible explanation (unpublished data). We then decided to directly 
p introduce (in exactly the same context) one of the fingers (TRTNKS) (SEOIDNO: 13) that had 
?| been selected by phage display (6) and see whether it could work in our system as a Gall IP-zinc 
«f5 finger fusion protein. We find that NRE selection strain cells expressing this TRTNKS (SEQ ID 
?lj NO: 13) finger fusion protein grow very poorly on HIS selective medium whereas the same cells 
expressing the NSGSWK (SEP ID NO: 10) finger fusion (obtained in our selections) grow 
robustly (unpublished data). The simplest explanation for this result is that the TRTNKS (SEQ 
ID NO: 13) finger fusion binds poorly to the NRE subsite and therefore only weakly stimulates 
HIS3 expression. This explanation is supported by our observation that earlier selections with the 
NRE subsite, using a prototype of our system in which zinc fingers were expressed from a much 
higher copy number phagemid, had yielded the TRTNKS (SEOIDNO: 13) as well as the 
NSGSWK (SEOIDNO: 10) finger (J.KJ. and C.O.P., unpublished data). This suggests that our 
current system sets a very stringent standard for the NRE selections and may account for why we 
isolated such a small number of specific candidates. 
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On page 87, the first (1 st ) complete paragraph was amended as follows: 

We also used our binding site preference assay to compare the specificity of the 
NSGSWK (SEOIDNO: 10) finger we had selected for the NRE subsite with that of the 
TRTNKS (SEC) ID NO: 13) finger selected by phage display. In our bacterial-based assays, the 
NSGSWK (SEP ID NO: 10) finger binds specifically to the NRE subsite and binds only very 
weakly to the TATA subsite. By contrast, the TRTNKS (SEOIDNO: 13) finger binds only 
weakly to all four subsites (exhibiting a preference for the NRE and TATA subsites over the p53 
and Zif subsites) (unpublished data). These results suggest that the NSGSWK (SEOIDNO: 10) 
finger we selected actually binds more tightly and specifically in our system than the TRTNKS 
(SEOIDNO: 13) finger identified earlier by phage display. 

On page 87, the second (2 nd ) complete paragraph was amended as follows: 

Each of our three selections also yielded a small percentage of fingers that bind non- 
specifically to all four DNA subsites tested. Surprisingly, all of these fingers match a consensus 
sequence of the form R+WL+L (SEP ID NO: 14) (where + denotes a positively charged residue, 
Figure 3D). These fingers are rich in positive charge and may make extra phosphate contacts. 
We also note that all of these fingers have a tryptophan residue at position 2 and thus would not 
have been present in the libraries used for earlier phage display experiments. This highly 
conserved set of non-specific fingers raises many interesting questions: What level of specificity 
is required for a zinc finger protein to function in our assay (and thus to what extent does the E. 
coli chromosome function as a non-specific competitor)? How do these fingers bind? Why is this 
particular class of non-specific fingers the only type selected in our system? 

On page 94, the first (1 st ) complete paragraph was amended as set forth below. Please note 
that the SEQ ID NOs have been added by virtue of this amendment All other underlining, 
i.e., CXGGACACGTX and CGGG ANNNNN G appearing in lines 5 and 7 of this 
paragraph, was part of the original text. 

In addition to selecting proteins that bind to a specific DBS, this bacterial ITS can also be 
used to select DBS ? s that interact with a specific protein. Figure 9 shows the results for such an 
in vivo site selection to select DNA sequences that interact with the P53 zf protein. The consensus 
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binding site, as determined by Wolfe et. al., JMB 285, pl917-1934 (1999), for the P53 zf protein 
is CXGG ACACGTX (SEP ID NO: 15) where X indicates no clear sequence preference at that 
position. A library of EGFP reporter plasmids containing the partially randomized binding site 
CGGGANNNNNG (SEOIDNO: 16) was created (where N indicates a mixture of A, G, C, T) 
and introduced into host cells containing the a-Gal4 and Gall lp-PSS^ fusion proteins. These 
cells were then grown to saturation at 37°C in LB media with the appropriate antibiotics and then 
100 jxl of this culture was used to inoculate 10 ml of minimal media (as described in example 3) 
containing 10 ng/ml aTc and 100 jxM IPTG. These cultures were then incubated for 24 hours at 
30° C on a rotating drum incubator. After incubation, one round of FACS sorting was performed 
on a Cytomation MoFlo multiple laser FACS sorter and individual EGFP positive clones were 
selected. Of 20 clones analyzed, 16 were EGFP positive (i.e. expressed at least 2 fold more 
EGFP than control cells). These 16 positive clones contained three unique ¥53^ binding sites. 
The most abundant of these sites matched the consensus from the in vitro site selection. 
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Marked-iip version of claims showing changes made: 

Please amend claim 18 as set forth below: 

18 . The method of claim 16, wherein the weak DNA-binding domain comprises two 
Cys 2 His 2 (SEP ID NO: 4) zinc fingers. 
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